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001E0 INTRODUCTION 

00200 

00310 l shall attempt to explain what Super Foonly is all about to those 

00400 having only a knowledge of the DECsystem-10 (alias the PDP-10). This 

00500 description may well Include some DEC-centered b!as» since Its 

00600 author's salary is being paid fc>y the aforementioned computer 

00700 manufacturer • 

00800 

00900 Super Foonly Is touted by Its designers (primarily Messrs, John 

01000 Holloway, Ed McGuire, Ted Panofsky# Phil Petit. Dave Poole and Fred 

01120 Wright Calphabet'oal Iy3) as a replacement for the KA10 whloh runs ten 

01200 times as fast, It Is being designed at the Stanford Artificial 

013P0 Intelligence Laboratoryi funded by the Advanced Research Projeots 

01400 Agency of the DOD, As of this time (25 May 72) the design Is not yet 

015^0 oomplete, The designers estimate 2 to 3 months more design effort* 

01600 but I would double that estimate without much fear of being too 

017^0 short, 
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00120 GENERAL DESIGN APPROACH 

00150 

00200 In order to achieve the desired speed-up factori several new 

003e0 approaches we r e taken In the design. The primary departure from 

00400 traditional DEC designs is the Inclusion of a 2048 word h t oh speed 

00500 "cache" memory in the Processor, Stanford simulations Indicate that 

00620 In excess of 95% of all memory references will be found In the cache, 

007e0 For those unfamiliar with caohe memories, I will explain them in more 

00800 detail late r , 
00900 

01000 Super Fooniy consists of several "boxes"* all physically located 

01100 within two fairly standard DEC 30" cabinets, The boxes include the 

01200 «i" box which prepares Instructions by doing the effective address 

01300 calculation and doing the decoding necessary to determine the operand 

01400 requirements and fetching those operands? the "E" box which does the 

01500 exeoutlon of the computations} the "M" box which holds the cache and 

01600 memory mapping hardware? the "I/O" connection box which filters a|| 

01700 l/o memory operations; and the "IOT" box which performs I/O bus 

01800 operations! 

01900 

02000 The machine operates in a pipe-line fashion* with all boxes operating 

02100 concurrent lyi generally performing their various functions on a 

02200 seauence of instructions, Through thjs means. It Is expected that 

02300 simple Instructions oan be done In one or two machine eyoles of 100 

02400 nan for an average Instruction time of ( say# 200 ns, compared to 

02500 the >2 microseconds of the KA10, 

02600 

02700 

02800 

02900 

03000 

03100 

032E0 

03300 

03400 

03500 

03600 

03700 

03800 
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gn|g I BOX DESCRIPTION 

003£0 The I box prepares Instructions for execution, It contains the 

004j?0 Program Counter {PC) and the accumu lators. There |g only one block of 

00520 AC's (like the KAl0)i however* there are 3 copies of them, There Is 

00600 also an Advanced PC which supplies the addresses for the two possible 

00700 prefetohed Instructions, 
008^0 

00900 in operation, the I box has the current Instruction in a register 

01000 called the UR, The effective address Is calculated using one copy 

01100 of the AG's. whloh always supplies the XR, If neededi using an adder 

012£0 reserved for this purpose, Another copy of the AC's decodes the "AC" 

01300 field, while the third copy supplies aC+1 or e(E> as reaulred by the 

01400 Instruction, The Instruction part Is sent to the I box control 

015§0 memory* which supplies what DEC calls the fetch and store cycle 

01600 switches, as well as a starting address In the E box control memory, 

01700 

01800 Should the Instruction require memory data, a reauest Is Issued to 

01900 the M box for the word required, In any case the data Is presented 

02020 to the e box for Its use as soon as It oan take It, , The I box Is 

02100 usually freed to go about its business for the next Instruction as 

02200 soon as the data Is Passed to the E box, This frequently means that 

02300 the I box will have the next Instruction prepared In 1 or 2 cycles, 

02400 whloh overlaps with the typically 1 cycle taken by the E box to do 

02500 the execution, 
02600 

027g0 There are, or course, complications which make all of the above 

028g>0 decidedly non-trlvlal, First there Is the matter that the E box does 

029^0 not return AC results until the end of an Instruction execution, 

03000 Should an Instruction use an AC being modified by the previous 

031®0 Instruction, It would normally take an extra cycl| to write that AC, 

03200 and make It available again, so numerous comoarltors exist to detect 

033Z0 these cases, and provide dlreot "loop-around" paths*, where possible, 

03400 for the E box to use its outputs directly, Another problem Involves 

03500 conflicts between words being stored by the E box In memory, and 

03600 either Instructions or data being ore-fetched, Much hairy lode 

03700 exists to detect and provide for these cases, 
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00120 
002*0 
003g0 
004^0 
00570 
00600 
007£0 
008Z0 
00900 

01000 
01100 

01200 
01300 
01400 
01500 
01600 
0l7g0 

01750 
01800 
01900 
02000 
02100 
02200 
02300 
02^00 
02500 
02600 
02700 



E BOX DESCRIPTION 



The E box, as mentioned above, actual 
It has a 512 word by 144 bit contro 



y performs the calculations, 
memory which aotually oontrois 



Its operations, however bits of the Instruction register are used to 
distinguish between similar Instructions. Thus, although It Is 
claimed that arbitrary new Instructions oan be microprogrammed, much 
of the logic Is closely tied to existing Instructions, 
Microprogramming will likely be painful even to a system wizard. 

The E box features suoh goodies as a complete 72 bit shift matrix* 
allowing all shifts to be done in one cycle; a normal I ze r which can 
normalize floating point numbers in 1 or 2 cyclesi an *dde r with two 
complete sets of ALUs* such that one set has permanently true earrys 
and the other has permanently false earrys, and the next mixer stegfi 
selects between them In 4 bit groups according to the aotual earrys) 
and a 4 bit at a time multiplication algorithm, 

The main E box data oath circulates thru 3 sets of latches, although 
there are t'mes when the cycle Is thru only two latch registers. The 
way this works is that while the data is being held in the first sej 
of latchesi the second set Is opened to receive data, The second set 
Is then locked (or latched up, if you prefer), Its Inputs are 
deselected by Inverting the hold signal, The first register Is then 
unlatched, and set to look at a funotlon of the second, After enough 
time for all data to arrive and circulate thru the latches has been 
allowed, the first latches are locked up, followed by deselecting the 
lnouts» and the cycle repeats. 
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00l£0 M BOX DESCRIPTION 

00150 

20220 Th« M box contains the cache, the map and the Interface to oore 

003.J0 memory, A|| memory requests In the system (including direct to memory 

004?0 devices) a r e filtered thru the M box, as this Is the only way to 

00500 avoid sta|e data from accumulating In the cache, 

00600 

007^0 The oache consists of 4 blocks* each containing 128 quadruple data 

00800 words and 128 address words for a total of 2048 data words. Locations 

00900 In the 128 word dimension are binary selected from bits 27 thru 33 of 

010£0 the address. Words in the quad-word s I Ice are selected by bits 34 

01100 and 35, Thus, independent of any high order address bits, 4 possible 

01200 candidate words and their associated address words are selected from 

01300 the oache, one from each block, These address words contain the 

01400 physical page number (bits 14 thru 27) of the quad of data words, The 

01500 physical page number from the map Is then simultaneously compared to 

01620 the 4 selected address words to determine whether any of them matohi 

01700 and if so, the data from that oarttoular word Is selected, 
01800 

0l9g0 The map Is reminiscent of the BBN map, in that It has hardware 

02000 provisions for indirect and shared pages, and bits which tell whether 

02100 a particular page has yet been referenced or written into. The map 

02200 takes 14 address bits In (bits 13 - 26) and gives 13 physical address 

02300 bits outi whloh are the physical page number mentioned above. The 

02400 high order 5 virtual address bits are provided by the "extend mode" 

02500 feature, which causes Indirect and Indexed references to use bits 13 

02600 - 35 as add r e s s bits, and bits - 4 as the Index and indirect fields 

02700 of Indirect words, This mode Is under user contro j f ., and can also be 

02800 entered for a single instruction by a special execute instruction, 
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02920 
02950 
03000 

031S0 
03200 
03300 
03420 
03500 
03550 
03620 
03700 
03800 
03920 
04000 

04100 



I/O and IOT Boxes 

Words going between direct memory access channels and memory are also 
checked against the cache to insure that the cache copy of any given 
word does not get stale, The memory cycles necessary for I/O words 
are still taken from core thru their own memory busses, since only 
one caohe cycle Is taken for every 4 words, even a 2 microsecond per 
word transfer device uses only one cache cycle out of 90. The buffers 
and Interfaces for this purpose constitute the I/O box, 




